Close

1. Identity statement
Reference TypeConference Paper (Conference Proceedings)
Sitesibgrapi.sid.inpe.br
Holder Codeibi 8JMKD3MGPEW34M/46T9EHH
Identifier8JMKD3MGPAW/3MC59RH
Repositorysid.inpe.br/sibgrapi/2016/08.31.17.22
Last Update2016:08.31.17.22.17 (UTC) administrator
Metadata Repositorysid.inpe.br/sibgrapi/2016/08.31.17.22.17
Metadata Last Update2022:05.18.22.21.09 (UTC) administrator
Citation KeyCavalinDornCruz:2016:ClLiEv
TitleClassification of Life Events on Social Media
FormatOn-line
Year2016
Access Date2024, Apr. 29
Number of Files1
Size192 KiB
2. Context
Author1 Cavalin, Paulo
2 Dornelas, Fillipe
3 Cruz, Sergio
Affiliation1 IBM Research
2 IBM Research, Universidade Federal Rural do Rio de Janeiro
3 Universidade Federal Rural do Rio de Janeiro
EditorAliaga, Daniel G.
Davis, Larry S.
Farias, Ricardo C.
Fernandes, Leandro A. F.
Gibson, Stuart J.
Giraldi, Gilson A.
Gois, João Paulo
Maciel, Anderson
Menotti, David
Miranda, Paulo A. V.
Musse, Soraia
Namikawa, Laercio
Pamplona, Mauricio
Papa, João Paulo
Santos, Jefersson dos
Schwartz, William Robson
Thomaz, Carlos E.
e-Mail Addresspcavalin@br.ibm.com
Conference NameConference on Graphics, Patterns and Images, 29 (SIBGRAPI)
Conference LocationSão José dos Campos, SP, Brazil
Date4-7 Oct. 2016
PublisherSociedade Brasileira de Computação
Publisher CityPorto Alegre
Book TitleProceedings
Tertiary TypeIndustry Application Paper
History (UTC)2016-08-31 17:22:17 :: pcavalin@br.ibm.com -> administrator ::
2022-05-18 22:21:09 :: administrator -> :: 2016
3. Content and structure
Is the master or a copy?is the master
Content Stagecompleted
Transferable1
KeywordsSocial Media
Life Events
Classification
Umbalanced datasets
AbstractIn this paper we present an investigation of life event classification on social media networks. Detecting personal mentions about life events, such as travel, birthday, wedding, etc, presents an interesting opportunity to anticipate the offer of products or services, as well to enhance the demographics of a given target population. Nevertheless, life event classification can be seen as an unbalanced classification problem, where the set of posts that actually mention a life event is significantly smaller than those that do not. For this reason, the main goal of this paper is to investigate different types of classifiers, on a experimental protocol based on datasets containing various types of life events in both Portuguese and English languages, and the benefits of over-sampling techniques to improve the accuracy of these classifiers on these sets. The results demonstrate that a Logistic Regression may be a poor choice to deal with the original datasets, but after over-sampling the training set, such classifier is able to outperform by a significant margin other classifiers such as Naive Bayes and Nearest Neighbours, which do not benefit as well from the over-sampled training set in most cases.
Arrangementurlib.net > SDLA > Fonds > SIBGRAPI 2016 > Classification of Life...
doc Directory Contentaccess
source Directory Contentthere are no files
agreement Directory Content
agreement.html 31/08/2016 14:22 1.2 KiB 
4. Conditions of access and use
data URLhttp://urlib.net/ibi/8JMKD3MGPAW/3MC59RH
zipped data URLhttp://urlib.net/zip/8JMKD3MGPAW/3MC59RH
Languageen
Target FileSibgrapiWIA_LifeEvents_2016_cameraready.pdf
User Grouppcavalin@br.ibm.com
Visibilityshown
Update Permissionnot transferred
5. Allied materials
Mirror Repositorysid.inpe.br/banon/2001/03.30.15.38.24
Next Higher Units8JMKD3MGPAW/3M2D4LP
Citing Item Listsid.inpe.br/sibgrapi/2016/07.02.23.50 9
Host Collectionsid.inpe.br/banon/2001/03.30.15.38
6. Notes
Empty Fieldsarchivingpolicy archivist area callnumber contenttype copyholder copyright creatorhistory descriptionlevel dissemination doi edition electronicmailaddress group isbn issn label lineage mark nextedition notes numberofvolumes orcid organization pages parameterlist parentrepositories previousedition previouslowerunit progress project readergroup readpermission resumeid rightsholder schedulinginformation secondarydate secondarykey secondarymark secondarytype serieseditor session shorttitle sponsor subject tertiarymark type url versiontype volume


Close